NOVEL INDOLE DERIVATIVE FOR ALKYLATING SPECIFIC BASE 
SEQUENCE OF DNA AND ALKYLATING AGENT AND DRUG CONTAINING THE 

DERIVATIVE 

This is a national stage application under 35 U.S.C. § 
371 of PCT/ JP2005/004250 filed on March 10, 2005, which 
claims priority from Japanese patent application 2004-114793 
filed on March 13, 2004, all of which are incorporated 
herein by reference. 

Technical Field 

The present invention relates to a novel indole 
derivative for alkylating a specific base sequence of 
deoxyribonucleic acid (DNA) . The present invention also 
relates to an alkylating agent and a drug that contain the 
indole derivative. Specifically, the present invention 
relates to a novel indole derivative that particularly 
suppresses the occurrence of genetic abnormalities and has 
high anticancer activity and significantly high reactivity. 
The present invention also relates to an alkylating agent 
and a drug that contain the indole derivative. 
Background Art 

Now that the elucidation of the gene arrangement of 
humans has been substantially completed, many researchers 
have focused on a molecule having the ability to 
specifically bind to a particular base sequence. Non-Patent 
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Document 1 and the like show that Dervan et al . found that 
an antiparallel oriented "N-methy lpyrr ole " - "N- 
methyl imidazole " poly amide (hereinafter, N-methy lpyrr ole 
being simply referred to as "pyrrole" or "Py"; and N- 
methylimidazole being simply referred to as "imidazole" or 
"Im") base-sequence-specif ically binds to the minor groove 
of DNA. Furthermore, general rules that Py-Im recognizes a 
C-G base pair, Im-Py recognizes a G-C base pair, and Py-Py 
recognizes an A-T base pair or a T-A base pair have been 
derived . 

Various hairpin polyamides and cyclic polyamides have 
been synthesized by introducing a covalent bond into Py-Im 
in order to prevent the loss of entropy due to binding and 
to achieve stronger binding and the ability to recognize a 
sequence. A P-linker ( -NHCH 2 CH 2 CO- ) and a y-linker (- 
NHCH2CH2CH2CO- ) are typically known. It has been described 
that a hairpin polyamide containing the y-linker (- 
NHCH2CH2CH2CO- ) particularly has excellent binding ability 
and recognition ability. In addition, the structure of a 
complex between the hairpin polyamide containing the y- 
linker and DNA has been determined (see Non-Patent Document 
2). P~P recognizes an A-T base pair or a T-A base pair. 
Similarly, the y-linker recognizes an A-T base pair or a T-A 
base pair. The y-linker has a y-turn structure represented 
by the trifurcate portion of a bent hairpin model. 
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These molecules each have an association constant and 
specificity comparable to a transcription factor and the 
like. However, the regulation of gene expression is 
performed by inhibiting binding of the transcription factor. 
Thus, base sequences that can be targeted is significantly 
limited (for example, see Non-Patent Document 3). 

The inventors have developed a hybrid molecule of a Py- 
Im polyamide bound to a segment A (Du) serving as an 
alkylating moiety of duocarmycin A, which is an antibiotic, 
and filed an application (for example, Patent Document 1). 
The hybrid molecule selectively alkylated one site of a DNA 
fragment of 450 base pairs on the basis of the sequence 
recognition ability of the Py-Im polyamide (for example, see 
Non-Patent Document 4). However, completion of the reaction 
requires several days. Furthermore, the reaction efficiency 
was as low as several percent. 

In contrast, the inventors found that ImPyLDu86 
containing a vinyl linker (L) disposed between a functional 
group (hereinafter, referred to as an "alkylating moiety") 
for alkylating DNA and the Py-Im polyamide dimerizes and 
selectively reacts with both chains at sites thereof located 
5 base pairs apart in a 5 1 -YG ( A/T ) CR-3 1 sequence (wherein Y 
represents a pyrimidine base; and R represents a purine 
base) at a low concentration to cause alkylation with an 
efficiency as high as 70%. That is, the inventors found 
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considerable improvement in reactivity and efficiency by 
introducing the linker moiety (for example, see Non-Patent 
Document 5) . 

Patent Document 1: International Publication No. 
WO00/15641 

Non-Patent Document 1: J. Am. Chem. Soc. 1997, 119, 

7636 

Non-Patent Document 2: J. Am. Chem. Soc. 1997, 119, 

7909 

Non-Patent Document 3: J. Am. Chem. Soc. 2000, 122, 

4856 

Non-Patent Document 4: J. Am. Chem. Soc. 1999, 121, 

4961 

Non-Patent Document 5: J. Am. Chem. Soc. 2000, 122, 

1602 

Disclosure of Invention 

Problems to be Solved by the Invention 

However, the hybrid molecule described in Non-Patent 
Document 5 has problems related to difficulty of synthesis, 
the lability of the linker moiety, the reactivity of the 
alkylating moiety, and the like. 

Accordingly, it is an object of the present invention 
to provide a novel pyrrole-imidazole polyamide compound for 
alkylating the specific base sequence of DNA, the polyamide 
compound being capable of being synthesized through fewer 
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reaction steps than known hybrid molecules and having a 
combination of a high reactivity in DNA alkylation and the 
ability to recognize a sequence. Furthermore, it is an 
object of the present invention to provide an alkylating 
agent and a molecule serving as a drug, the alkylating agent 
and the molecule containing the polyamide compound. 
Means for Solving the Problems 

The inventors have conducted intensive studies and 
found that the use of a pyrrole-imidazole polyamide compound 
having an alkylating moiety at a terminus thereof via an 
indole linker solved the problems described above. This 
finding resulted in completion of the present invention. 

That is, an indole derivative of the present invention 
is represented by general formula (1): 




(i) 

wherein R 1 represents a functional group for alkylating DNA; 
R 2 represents a hydrogen atom, an alkyl group, or an acyl 
group; and X represents a divalent group having one 
constitutional unit or having two or more constitutional 
units which may be the same or different, the constitutional 
unit being represented by the following formula: 
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(wherein m is an integer of 0 to 10), wherein among the 
constitutional units, a terminal constitutional unit 
adjacent to R 2 may be a constitutional unit represented by 
the following formula: 




(wherein k is an integer of 0 to 10) . 

An alkylating agent of the present invention for 
alkylating DNA is composed of the indole derivative of the 
present invention described above. 

An inventive drug contains the alkylating agent of the 
present invention and suppresses or activates the expression 
of a gene. 
Advantages 

An indole derivative of the present invention is a 
functional molecule selectively alkylating a specific base 
sequence present in a gene. The indole derivative can 
change a target base sequence by changing the configuration 
of imidazole, pyrrole, and the like in its molecule. 
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Furthermore, a method for synthesizing the indole derivative 
is practical and highly applicable in view of the production 
of a wide variety of derivatives. Therefore, it is possible 
to logically design a drug on a genetic level, the drug 
being applicable to an important gene arrangement or an 
abnormal gene derived from a disease, such as cancer, in the 
human genome . 

Brief Description of the Drawings 

Fig. 1 shows abilities of compounds (4-1) and (3-1) of 
the present invention to DNA-sequence-specif ically alkylate 
a long-chain DNA (pUC-II and pUC-II 1 ); and a sequence 
recognition model thereof. 

Fig. 2 shows the abilities of compounds (3-2) and (3-3) 
of the present invention to DNA-sequence-specif ically 
alkylate a long-chain DNA (pUC-II and pUC-I 1 ) and sequence 
recognition models thereof. 

Fig. 3 shows a model of base-sequence-specific 
alkylation of DNA by compound (3-2) of the present invention. 

Fig. 4 shows abilities of compounds (3-2), (3-4), and 
(3-5) of the present invention to DNA-sequence-specif ically 
alkylate a long-chain DNA (?l-F10906); and sequence 
recognition models thereof. 

Fig. 5 shows the DNA-sequence-specif ic alkylation of a 
long-chain DNA (pQBl63) by compound (3-6) of the present 
invention . 
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Fig. 6 shows stability of compounds (4-1) and (3-1) of 
the present invention under acidic and alkaline conditions. 

Fig. 7 shows DNA-sequence-specif ic alkylation of a 
long-chain DNA (727 bp) by compound (3-7) alone or the 
combination of compounds (3-7) and (6) of the present 
invention . 

Fig. 8 shows telomere-sequence-specif ic alkylation of a 
DNA fragment (446 bp) having 24 repeats of a telomeric 
sequence by compound (4-3) of the present invention. 

Fig. 9 is a bar graph showing the anticancer activity 
of inventive compound (3-2) resulting from evaluation on 39 
cultured human cancer cell lines, such as breast cancer cell 
line and prostate cancer cell line. 
Best Mode for Carrying Out the Invention 

Preferred embodiments of the present invention will be 
described in detail below. 

An indole derivative of the present invention is 
represented by general formula (1): 




a) 

wherein X represents a 
constitutional unit or 
units which may be the 



divalent group having one 

having two or more constitutional 

same or different, the constitutional 
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unit being represented by the following formula: 




wherein among the constitutional units, a terminal 
constitutional unit adjacent to R 2 may be a constitutional 
unit represented by the following formula: 




wherein m and k are each an integer of 0 to 10, preferably 1 
to 6, and more preferably 2 to 3 . Furthermore, the number 
of constitutional units is not particularly limited but 
appropriately selected depending on a DNA sequence targeted 
for alkylation. Usually, the number of constitutional units 
is preferably 1 to 20 and more preferably 2 to 7. The 
sequence of the constitutional units is not particularly 
limited but appropriately selected depending on a DNA 
sequence targeted for alkylation to design a polyamide. 

A DNA sequence recognized by the indole derivative of 
the present invention complies with the known general rule 
in principle. That is, Py-Im recognizes a C-G base pair; 
Im-Py recognizes a G-C base pair; Py-Py recognizes an A-T 
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base pair or a T-A base pair; and P~P, Y~Yr or the like 
recognizes an A-T base pair or a T-A base pair. Furthermore, 
P~P, y-y, or the like not only recognizes the base pair but 
also constitutes the trifurcate portion of a bent hairpin 
model. As has been generally known, in rare cases, a 
mismatch occurs depending on sequences around a target DNA 
sequence. For example, in an AT-rich sequence, Im-Py and 
the like adjacent to the y-turn structure in a hairpin-shape 
molecule often misrecognize . 

In an indole linker, which is the advantage of the 
compound of the present invention, two of Im, Py, and the 
like relative to one indole can form a pair. Two of Im, Py, 
and the like to form a pair are not necessarily needed. One 
of Im, Py, and the like may be used. Indole tends to 
recognize C or T similar to recognition by Py . Furthermore, 
the alkylating moiety of the indole derivative of the 
present invention alkylates the N-3 position of a purine 
base, in particular, the N-3 position of adenine. 

The alkylating moiety represented by R 1 in the above- 
described formula (1) is not particularly limited. Any 
moiety having the abilities to recognize and alkylate a 
specific base sequence in DNA may be used. Preferred 
examples of the alkylating moiety, i.e., R 1 , include 
moieties represented by the following formulae: 
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(2) 



(4) 



or 



N 



O 



\C0 2 CH 3 
*CH 3 



(5) 



wherein in each of the moieties represented by formulae (2), 
(3), (4), and (5), a substituent may be introduced, and the 
structure of each moiety may be simplified within the range 
in which the alkylation reaction is not inhibited. 

The moieties represented by formulae (2) and (5) 
described above are each CPI derived from duocarmycin A. 
Hereinafter, the moiety represented by formula (2) is 
referred to as "DU86". The moiety represented by formula 
(5) is referred to as "Du" . Furthermore, 1,2,9,9a- 
tetrahydrocyclopropa [c] benzo [e] indol-4-one represented by 
formula (3) is referred to as "CBI" . CBI represented by 
formula (3) includes optically isomeric moieties (3-S) and 
(3-R) represented by the following formulae: 
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(3-S) (3-R) 

R 3 in formula (4) represents a hydrogen atom, a peptide 
chain, a carbohydrate chain, or a polyethylene glycol group. 
Examples thereof include peptide chains, proteins, 
monosaccharides , di sac char ides , polysaccharides , and 
polyethylene glycols. E represents an easily detachable 
substituent, such as a halogen atom, a mesyl group, or a 
tosyl group. Examples of halogen include bromine, fluorine, 
and iodine. Preferably, halogen is chlorine. 

An example of the moiety represented by formula (4) is 
a moiety of l-chloromethyl-5-hydroxy-l , 2-dihydro-3H- 
benzo [e] indole (referred to as " seco-CBI " ) represented by 
formula ( 4-A) : 




OH 



(4-A) 



The moiety represented by formula (4-A) includes optically 
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isomeric moieties represented by formulae (4-A-S) and (4-A- 
R) : 





The moieties represented by formulae (3-S) and (3-R) 
can be prepared by treating the moieties represented by 
formulae (4-A-S) and (4-A-R) with a weak alkali as shown in 
the following reaction schemes: 
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The moiety of S-CBI represented by formula (3-S) 
includes the three-membered ring having the same 
configuration as that of the naturally existing Du moiety 
represented by formula (5). The indole derivative 
represented by formula (1) in which R 1 is represented by 
formula (3-S) has alkylating activity. In contrast, the 
indole derivative represented by formula (1) in which R 1 is 
the moiety of R-CBI represented by formula (3-R) has 
significantly weak alkylating activity and is thus inactive. 

R 2 in formula (1) represents a hydrogen atom, an alkyl 
group, or an acyl group. The alkyl group and the acyl group 
(portion excluding CO) usually has a carbon atom number of 1 
to 20 and preferably 1 to 10. A linear or branched lower 
alkyl group having 1 to 6 carbon atoms is more preferred. 
Examples thereof include a methyl group, an ethyl group, a 
n-propyl group, an isopropyl group, a n-butyl group, an 
isobutyl group, a tert-butyl group, a pentyl group, and a 
hexyl group. The indole derivative represented by formula 
(1) in which R 2 is an acetyl group can be suitably used. 
Furthermore, each of an alkyl group and an acyl group may 
have a substituent. 

Examples of the indole derivative represented by 
general formula (1) include compounds represented by the 
following formulae : 
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(wherein R 1 represents the moiety represented by formula 
(3); and R 2 represents an acyl group); and compounds 
represented by the following formulae: 




By taking compound (3-1) as an example, the outline of 
a method for synthesizing the indole derivative of the 
present invention will be described according to synthetic 
scheme 1. AcImlmCC^H (3-1-1) and amine compound (3-1-2) 
prepared from 5-nitroindole-2-carboxylic acid ethyl ester by 
catalytic reduction are dissolved in dimethylf ormamide (DMF) . 
0- ( 7-azabenzatr iazol-l-yl ) -1, 1, 3 , 3-tetramethyluronium 
hexaf luorophosphate (HATU) functioning as a condensation 
agent and diisopropylethylamine ( 1 Pr 2 NEt) are added to the 
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resulting solution to perform a condensation reaction into 
Py-Im indole ethyl ester (3-1-3). Compound (3-1-3) is 
subjected to alkaline hydrolysis to prepare carboxylic acid 
(3-1-4). Next, carboxylic acid (3-1-4) thus prepared is 
coupled to seco-CBI (3-1-5) that can be synthesized from 
commercially available 1, 3-naphthalenediol, thereby 
preparing open-circular compound (4-1) as a compound of the 
present invention. Furthermore, open-circular compound (4- 
1) is treated with a weak alkali (NaHCOs) in water to 
prepare circular compound (3-1) as a compound of the present 
invention . 

(Synthetic Scheme 1) 




By taking compound (3-2) as an example, the outline of 
a method for synthesizing the indole derivative of the 
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present invention will be described according to synthetic 
scheme 2. By applying Fmoc solid-phase synthesis to a Py-Im 
polyamide using an oxime resin, carboxylic acid (3-2-1) 
corresponding to the above-described hairpin Py-Im polyamide 
is prepared. After carboxylic acid (3-2-1) is converted 
into active At ester (3-2-2) using HATU in the reaction 
system, aminoindolecarboxylic acid (3-2-3) is added thereto 
to synthesize indolecarboxy lie acid (3-2-4) at a time. 
Compound (3-2-4) is coupled to seco-CBI (3-1-5) that can be 
prepared from commercially available 1 , 3-naphthalenediol , 
thereby preparing open-circular compound (4-2) as the indole 
derivative of the present invention. Furthermore, open- 
circular compound (4-2) is treated with a weak alkali 
(NaHC03) in water to prepare circular compound (3-2) as the 
indole derivative of the present invention. Circular 
compounds (3-3), (3-4), (3-5), and the like as indole 
derivatives of the present invention can be prepared in the 
same way. 

(Reaction Scheme 2) 
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A DNA-alkylat ing agent of the present invention is 
composed of the inventive indole derivative represented by 
general formula (1). 

The indole derivative of the present invention used as 
the alkylating agent may have a hairpin structure to 
recognize DNA. Alternatively, the indole derivative may 
dimerize to recognize DNA. 

In addition to the hairpin structure and the 
dimerization, DNA may be recognized by adding a compound 
having one divalent constitutional unit or having two or 
more constitutional units which may be the same or different, 
the constitutional unit being represented by the following 
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formula : 




(wherein n is an integer of 0 to 10), wherein among the 
constitutional units, a terminal constitutional unit 
adjacent to an N-terminus may be a constitutional unit 
represented by the following formula: 




(wherein q is an integer of 0 to 10). That is, a base pair 
may be recognized by pairing the compound with different 
indole derivative (1) of the present invention. 
Furthermore, a base pair may be recognized by pairing the 
compound with a known minor-groove binder. 

A drug of the present invention contains the alkylating 
agent of the present invention and thus suppresses or 
activates the expression of a gene. 

Examples of the gene suppressed or regulated include 
abnormal genes, single nucleotide polymorphisms (SNPs), and 
oncogenes. The inventive indole derivative used as an 
active component compound of the drug of the present 
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invention may be a pharmaceut ically acceptable salt thereof. 
Examples of the salt include commonly used salts, such as 
salts with acids, e.g., hydrochlorides, phosphates, citrates, 
and sulfonates, and salts with organic bases, e.g., 
methylamine and ethylenediamine . 

The indole derivative or the pharmaceut ically 
acceptable salt thereof has advantages described as follows: 

(a) the effect of the compound can be expected in a small 
amount because of high uptake by cancer cells; 

(b) the compound can be expected to be orally absorbed 
because the compound is stable in an acid or an alkali; 

(c) it is possible to control the expression of a specific 
gene; and 

(d) no damage to normal cells is caused compared with known 
DNA alkylating agents. 

The drug of the present invention is expected to have a 
therapeutic effect on various diseases by controlling the 
expression of a gene. In particular, the drug is useful for 
the treatment and prevention of cancer and can be used for 
deep cancer. When the drug is used as an anticancer agent, 
the drug is formed into a commonly used formulation, such as 
an injection, a tablet, a powder, or a capsule. In making 
the formulation, a commonly used, pharmaceut ically 
acceptable carrier, such as a binder, a lubricant, a 
disintegrator, a solvent, a dispersant, a stabilizer, a 
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pigment, or a flavor, may be used. The content of the 
indole derivative in the formulation is different depending 
on types of indole derivative and formulation and is usually 
0.2 to 50 percent by weight. 

The dosage of the inventive drug as an anticancer agent 
is different depending on age, weight, pathology, a 
therapeutic effect, an administration method, administration 
timing, dosing days, and the duration of drug administration. 
Usually, 10 to 400 mg of the drug is administered once a day 
for 1, 2, or 3 weeks and 1 week following drug withdrawal as 
one course of treatment. The administration is repeated. 
Furthermore, by changing the configuration of imidazole, 
pyrrole, the P-linker, and the like, the drug can be applied 
to various types of cancer. Moreover, the drug is used for 
cancer that causes further expression of abnormal genes. In 
particular, the drug is used for abnormality of the genes in 
cancer on which a known anticancer agent does not easily 
exert an effect. 
EXAMPLES 

While the present invention will be specifically 
described by examples, the present invention is not limited 
to these examples. 
SYNTHESIS EXAMPLE 1 and 2 

Synthesis of Compounds (4-1) and (3-1) (Scheme 1) 

(1) Synthesis of AcImImC0 2 H (3-1-1) 
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Compound (3-1-1) was synthesized according to a method 
described in literature [(a) Tao, Z. -F . ; Fujiwara, T . ; 
Saito, I.; Sugiyama, H. J. Am. Chem. Soc. 1999, 121, 4961. 
(b) Tao, Z. -F . ; Saito, I.; Sugiyama, H. J. Am. Chem. Soc. 
2000, 122, 1602. (c) Bando, T . ; Narita, A.; Saito, I.; 
Sugiyama, H. Chem. Eur. J. 2002, 8, 4781]. 

1 H-NMR (50 0 MHz, DMSO-d 6 ) 8: 10.37 (s, 1H; NH), 9.60 (s, 1H; 
NH), 7.63 (s, 1H; CH), 7.48 (s, 1H; CH), 3.96 (s, 3H; NCH 3 ), 
3.93 (s, 3H; NCH 3 ) , 2.03 (s, 3H; COCH 3 ); 
ESIMS m/e calcd. for Ci 2 H 15 N 6 0 4 
[M + + H] 307.1, found 307.3 

(2) Synthesis of H 2 N-Indole-C0 2 Et (3-1-2) 

Compound (3-1-2) was synthesized from commercially 
available 5-nitroindole-2-carboxylic acid ethyl ester as a 
starting material by catalytic reduction with Pd-C in a 
hydrogen atmosphere. Compound (3-1-2) was used as a 
starting material for synthesizing compound (3-1-3) without 
purification . 

(3) Synthesis of AcImIm-Indole-C0 2 Et (3-1-3) 

Compound (3-1-1) (305 mg, 1.05 mmol) and compound (3-1- 
2) (215 mg, 1.05 mmol) were dissolved in DMF (2 mL). Then, 
i Pr 2 NEt (550 juL, 3.15 mmol) and HATU (480 mg, 1.26 mmol) were 
added to the resulting solution. The resulting mixture was 
stirred at room temperature for 15 hours in a nitrogen 
atmosphere. After completion of the reaction, the solvent 
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in the reaction solution was distilled off under reduced 
pressure. The resulting residue was purified by silica-gel 
column chromatography (3% to 5% MeOH in CH2CI2, gradient 
elution) , followed by removing the solvent to yield a yellow 
powder (3-1-3) (326 mg, 52%) . 

1 H-NMR (500 MHz, DMSO-d 6 ) 8: 11.83 (s, 1H; NH) , 10.32 (s, 1H; 
NH), 10.08 (s, 1H; NH) , 9.40 (s, 1H; NH), 8.16 (s, 1H; CH), 
7.60 (s, 1H; CH), 7.56 (d, 1H, J = 9.0 Hz; CH), 7.50 (s, 1H; 
CH), 7.40 (d, 1H, J = 9.0 Hz; CH) , 7.12 (s, 1H; CH) , 4.33 (q 
2H, J = 7.0 Hz; OCH 2 ), 4.01 (s, 3H; NCH 3 ), 3.98 (s, 3H; NCH 3 ) 
2.03 (s, 3H; COCH 3 ), 1.33 (t, 3H, J = 7.0 Hz; CH 3 ); 
ESIMS m/e calcd. for C23H25N8O5 
[M + + H] 493.2, found 493.2 

(4) Synthesis of Aclmlm-Indole-C0 2 H (3-1-4) 

MeOH (8 mL) and 1 N aqueous sodium hydroxide solution 

(8 mL) were added to compound (3-1-3) (326 mg, 0.66 mmol). 
The resulting mixture was stirred at room temperature for 30 
minutes. After removal of MeOH under reduced pressure, 10% 
aqueous HC1 solution was added thereto at 0°C to make the 
mixture acidic (pH = 2). The resulting precipitate was 
collected by filtration with a KIRIYAMA funnel, washed with 
water, and dried to yield compound (3-1-4) (297 mg, 96%). 
^"H-NMR (500 MHz, DMSO-d 6 ) 8: 11.71 (s, 1H; NH), 10.35 (s, 1H; 
NH), 10.08 (s, 1H; NH) , 9.49 (s, 1H; NH) , 8.13 (d, 1H, J = 
1.0 Hz; CH), 7.60 (s, 1H; CH), 7.54 (dd, 1H, J = 2.0 and 9.0 



Hz; CH), 7.49 (s, 1H; CH), 7.38 (d, 1H, J = 9.0 Hz; CH) , 
7.06 (d, 1H, J = 2.0 Hz; CH), 4.01 (s, 3H; NCH 3 ), 3.97 (s, 
3H; NCH 3 ), 2.03 (s, 3H; COCH 3 ); 
ESIMS m/e calcd. for C 2 iH 21 N 8 0 5 
[M + + H] 465.2, found 465.2 

(5) Synthesis of seco-CBI (3-1-5) 

Compound (3-1-5) was synthesized as a hydrochloride 
according to a method described in literature [(a) Boger, D. 
L . ; Yun, W. Y.; Teegarden, B. R. J. Org. Chem. 1992, 57, 
2873. (b) Boger, D. L . ; McKie, J. A. J. Org. Chem. 1995, 60, 
1271. (c) Boger, D. L . ; Ishizaki, T.; Kitos, P. A.; 
Suntornwat, 0. J. Org. Chem. 1990, 55, 5823]. Spectrum data 
were reported in the literature. 

(6) Synthesis of AcImlm-Indole-seco-CBI (4-1) 

seco-CBI (3-1-5) (32.7 mg, 0.13 mmol), EDCI (50.3 mg, 
0.26 mmol), and NaHC0 3 (41.9 mg, 0.52 mmol) were charged 
into a reaction vessel containing compound (3-1-4) (60.3 mg, 
0.13 mmol). The resulting mixture was dissolved in DMF (700 
jiL) and stirred at room temperature for 8 hours in a 
nitrogen atmosphere. After confirmation of completion of 
the reaction, the solvent was distilled off. The resulting 
residue was purified by silica-gel column chromatography (5% 
to 10% MeOH in CH 2 C1 2 , gradient elution) . The resulting 
solid was washed with CHC1 3 and dried to yield a brown 
powder (4-1) (85.2 mg, 96%). 
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1 H-NMR (500 MHz, DMSO-d 6 ) 8: 11.71 (s, 1H; NH ) , 10.43 (s, 1H; 
OH), 10.34 (s, 1H; NH ) , 10.11 (s, 1H; NH) , 9.41 (s, 1H; NH), 
8.16 (s, 1H; CH), 8.12 (d, 1H, J = 8.0 Hz; CH) , 7.97 (brs, 
1H; CH), 7.85 (d, 1H, J = 8.0 Hz; CH), 7.62 (m, 1H; CH), 
7.61 (s, 1H; Im-H), 7.52 (t, 1H, J = 8.0 Hz; CH) , 7.51 (s, 
1H; Im-H), 7.45 (d, 1H, J = 9.0 Hz; CH), 7.36 (t, 1H, J = 
8.0 Hz; CH), 7.19 (s, 1H; CH), 4.81 (t, 1H, J = 11.0 Hz; 
NCHH), 4.56 (d, 1H, J = 11.0 Hz; NCHH) , 4.23 (brt, 1H; CH), 
4.03 (s, 3H; NCH 3 ), 3.99 (s, 3H; NCH 3 ), 3.87 (dd, 2H, J = 7.0 
and 11.0 Hz; CH 2 ) , 2.04 (s, 3H; COCH 3 ) ; 
ESI-TOFMS m/e calcd. for C34H31CIN9O5 

[M + + H] 680.22, found 680.23 

(7) Synthesis of Aclmlm-Indole-CBI (3-1) 

First, 5% aqueous NaHC0 3 (100 jxL) was added to a 
solution of compound (4-1) (6.3 mg, 9.28 mmol) in DMF (150 
jiL) . The resulting mixture was stirred at room temperature 
for 1 hour in a nitrogen atmosphere. After confirmation of 
completion of the reaction, the solvent was distilled off. 
The resulting residue was purified by silica-gel column 
chromatography (5% to 15% MeOH in CH2CI2, gradient elution) . 
The solvent was distilled off. The resulting residue was 
dried to yield a brown powder (3-1) (6.0 mg, quant.). 
1 H-NMR (500 MHz, DMSO-d 6 ) 8: 11.85 (s, 1H; NH), 10.35 (s, 1H; 
NH), 10.16 (s, 1H; NH) , 9.41 (s, 1H; NH), 8.17 (s, 1H; CH), 
8.00 (d, 1H, J = 8.0 Hz; CH), 7.94 (s, 1H; CH), 7.62 (s, 1H; 
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Im-H), 7.52 (m, 2H; CHx2), 7.51 (s, 1H; Im-H) , 7.43 (m, 2H; 
CH), 7.25 (d, 1H, J = 8.0 Hz; CH) , 6.95 (s, 1H; CH), 4.64 (m, 
1H; NCHH ) , 4.49 (d, 1H, J = 10.0 Hz; NCHH) , 4.02 (s, 3H; 
NCH 3 ), 3.98 (s, 3H; NCH 3 ), 3.27 (brt, 1H; CH), 2.03 (s, 3H; 
COCH3) , 1.75 (m, 1H; CHH) , 1.70 (m, 1H; CHH); 
ESI-TOFMS m/e calcd. for C34H30N9O5 
[M + + H] 644.23, found 644.21 
SYNTHESIS EXAMPLE 3 

Synthetic Reaction of Compound (3-2) (Scheme 2) 

(1) Synthesis of AcImImPyPy-y-ImPyC0 2 H (3-2-1) 

Compound (3-2-1) was prepared by an Fmoc solid-phase 
synthesizer. Purification was performed by HPLC with a 
Chemcobond 5-ODS-H column (conditions: linear gradient of 0- 
50% acetonitrile in 0.1% acetic acid, 40 min, 254 nm) . The 
resulting compound was used as a starting material for 
synthesis . 

1 H-NMR (500 MHz, DMSO-d 6 ) 8: 10.32 (s, 1H; NH ) , 10.29 (s, 1H; 

NH), 10.23 (s, 1H; NH) , 10.00 (s, 1H; NH) , 9.91 (s, 1H; NH) , 

9.32 (s, 1H; NH), 8.02 (s, 1H; NH) , 7.56 (s, 1H; Im-H), 7.50 
(s, 1H; Im-H), 7.45 (s, 1H; Py-H) , 7.44 (s, 1H; Im-H), 7.27 
(s, 1H; Py-H), 7.16 (s, 1H; Py-H), 7.14 (s, 1H; Py-H), 6.92 
(s, 1H; Py-H), 6.89 (s, 1H; Py-H), 4.00 (s, 3H; NCH 3 ) , 3.97 
(s, 3H; NCH 3 ) , 3.93 (s, 3H; NCH 3 ), 3.84 (s, 3H; NCH 3 ) , 3.81 
(s, 3H; NCH 3 ), 3.79 (s, 3H; NCH 3 ) , 3.19 (m, 2H; CH 2 ), 2.34 (m, 

2H; CH 2 ), 2.03 (s, 3H; COCH3) , 1.78 (m, 2H; CH 2 ); 



27 



ESI-TOFMS m/e calcd. for C39H45N16O9 
[M + + H] 881.35, found 881.36 

(2) Synthesis of H 2 N-Indole-C0 2 H (3-2-3) 

Compound (3-2-3) was synthesized from commercially 
available 5-nitroindole-2-carboxylic acid ethyl ester as a 
starting material by a two-step reaction ((i) alkali 
hydrolysis with 1 N NaOH; and (ii) catalytic reduction with 
Pd-C in a hydrogen atmosphere) . 

1 H-NMR (500 MHz, DMSO-d 6 ) 8: 11.22 (s, 1H; NH), 7.11 (d, 1H, 
J = 8.5 Hz; CH), 6.75 (d, 1H, J = 2.0 Hz; CH) , 6.67 (s, 1H; 
CH), 6.65 (dd, 1H, J = 2.0 and 8.5 Hz; CH) , 3.31 (brs, 2H; 
NH 2 , H 2 0) 

(3) Synthesis of AcImImPyPy-y-ImPy-Indole-C0 2 H (3-2-4) 

Compound (3-2-1) (3.5 mg, 3.98 mmol) prepared by a 
solid-phase synthesis was dissolved in DMF (75 jliL). Then, 
iPr 2 NEt (1.4 \xL, 8.0 4 mmol) and HATU (1.4 mg, 3.6 8 mmol) 
were added to the resulting solution. The resulting mixture 
was stirred at room temperature for 4 hours in a nitrogen 
atmosphere. After confirmation of completion of the 
reaction, compound (3-2-3) (1.3 mg, 7.38 mmol) and x Pr 2 NEt 
(1.3 jiL, 7.46 mmol) were added thereto. The resulting 
mixture was stirred overnight at room temperature in a 
nitrogen atmosphere. After the reaction, the solvent in the 
reaction solution was distilled off. The residue was 
filtrated using a KIRIYAMA funnel and washed with CH 2 C1 2 and 
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H 2 0. As a result, crude crystals of compound (3-2-4) were 
produced (3.0 mg, 73%). 

1 H-NMR (500 MHz, DMSO-d 6 ) 8: 11.33 (brs, 1H; NH) , 10.34 (s, 
1H; NH), 10.33 (s, 1H; NH) , 10.27 (s, 1H; NH) , 9.92 (s, 2H; 
NH), 9.73 (s, 1H; NH), 9.32 (brs, 1H; NH) , 8.01 (brt, 1H; 
NH), 7.94 (s, 1H; CH), 7.56 (s, 1H; Im-H) , 7.50 (s, 1H; Im- 
H), 7.46 (s, 1H; Im-H), 7.40 (brd, 1H, J = 8.5 Hz; CH), 7.31 
(brd, 1H, J = 8.5 Hz; CH) , 7.30 (d, 1H, J = 1.5 Hz; Py-H), 
7.26 (d, 1H, J = 1.5 Hz; Py-H), 7.17 (d, 1H, J = 1.5 Hz; Py- 
H), 7.16 (s, 2H; y-Hx2), 6.90 (d, 1H, J = 1.5 Hz; Py-H), 
6.85 (brs, 1H; CH) , 4.00 (s, 3H; NCH 3 ), 3.97 (s, 3H; NCH 3 ), 
3.95 (s, 3H; NCH 3 ) , 3.85 (s, 3H; NCH 3 ), 3.84 (s, 3H; NCH 3 ), 
3.80 (s, 3H; NCH 3 ) , 3.20 (dt, 2H, J = 6.0 and 7.5 Hz; CH 2 ), 
2.36 (t, 2H, J = 7.5 Hz; CH 2 ), 2.04 (s, 3H; COCH 3 ), 1.79 (qu, 
2H, J = 7 . 5 Hz; CH 2 ) ; 

ESI-TOFMS m/e calcd. for C 4 8H 5 iNi 8 Oio 
[M + + H] 1039.40, found 1039.39 

(4) Synthesis of AcImlmPyPy-y-ImPy-Indole-CBI (3-2) 

Into a reaction vessel containing the crude crystals of 
compound (3-2-4) (3.0 mg, 2.89 mmol), seco-CBI (3-1-5) (1.4 
mg, 6.01 mmol), EDCI (1.2 mg, 6.25 mmol), and NaHC0 3 (1.0 mg, 
12.0 mmol) were charged. The resulting mixture was 
dissolved in DMF (100 jj,L), and the resulting solution was 
stirred at room temperature for 2 hours in a nitrogen 
atmosphere. After confirmation of completion of the 
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reaction, 5% NaHC0 3 (300 jiL) and DMF (300 juL) were added 
thereto. The resulting mixture was stirred for 30 minutes. 
After completion of the reaction, the solvent was distilled 
off. The resulting residue was purified by silica-gel 
column chromatography (5% to 15% MeOH in CH2CI2, gradient 
elution) . The solvent was distilled off to yield crude 
crystals (3-2) (1.33 mg, 27% for 2 steps). Furthermore, 
purification was performed by HPLC (linear gradient of 0-50% 
acetonitrile in 0.1% acetic acid, 40 min, 254 nm) . The 
resulting solution was subjected to vacuum concentration and 
lyophilization to yield yellow crystals (3-2) (0.5 mg, 0.41 
jimol; 11% for 2 steps). 

1 H-NMR (500 MHz, DMSO-d 6 ) 8: 11.77 (s, 1H; NH), 10.32 (s, 1H; 
NH), 10.29 (s, 1H; NH) , 10.26 (s, 1H; NH) , 9.95 (s, 1H; NH) , 
9.92 (s, 1H; NH), 9.82 (s, 1H; NH) , 9.33 (s, 1H; NH) , 8.08 

(s, 1H; CH), 8.01 (brs, 1H; NH), 8.00 (d, J = 7.5 Hz, 1H; 
CH), 7.60 (t, J = 7.5 Hz, 1H; CH), 7.56 (s, 1H; CH), 7.53 (d, 
J = 8.5 Hz, 1H; CH) , 7.50 (s, 1H; CH) , 7.46 (s, 1H; CH), 
7.43 (t, J = 7.5 Hz, 1H; CH), 7.42 (s, 1H; CH), 7.31 (s, 1H; 
CH), 7.27 (s, 1H; CH), 7.24 (d, J = 8.5 Hz, 1H; CH), 7.22 (s, 
1H; CH), 7.19 (s, 1H; CH), 7.16 (d, J = 7.5 Hz, 1H; CH), 
7.06 (s, 1H; CH), 6.95 (s, 1H; CH) , 6.89 (s, 1H; CH) , 4.62 

(dd, J = 10.0 and 5.0 Hz, 1H; NCH 2 ) , 4.48 (d, J = 10.0 Hz, 
1H; NCH 2 ), 4.00 (s, 3H; NCH 3 ) , 3.97 (s, 3H; NCH 3 ) , 3.95 (s, 
3H; NCH3) , 3.86 (s, 3H; NCH 3 ) , 3.84 (s, 3H; NCH 3 ) , 3.80 (s, 
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3H; NCH 3 ), 3.20 (m, 2H; CH 2 ), 2.90 (m, 1H; CH), 2.35 (m, 2H; 
CH 2 ), 2.03 (s, 3H; COCH 3 ), 1.79 (m, 2H; CH 2 ), 1.76 (dd, J = 
7.5 and 5.0 Hz, 1H; CH) , 1.70 (t, J = 5.0 Hz, 1H; CH) ; 

ESI-TOFMS m/e calcd. for C 6 iH 60 N 19 Oi 0 
[M + + H] 1218.48, found 1218.48 

To synthesize AcImlmPyPy-y-ImPy-Indole-seco-CBI (4-2), after 
completion of the reaction, the solvent is distilled off 
without treatment with 5% NaHC0 3 . The resulting residue is 
purified by silica-gel column chromatography (5% to 15% MeOH 
in CH 2 C1 2 , gradient elution) and HPLC (linear gradient of 0- 
100% acetonitrile in 0.1% acetic acid, 20 min, 254 nm) . The 
resulting solution is subjected to vacuum condensation and 
lyophilization to yield yellow crystals (4-2). 
ESI-TOFMS m/e calcd. for C 6 iH 6 iC1Ni 9 Oio 
[M + + H] 1254.45, found 1254.50 
Synthesis Example 4 

Synthesis of AcImlmPy-y-Im-Indole-CBI (3-3) 

Compound (3-3) was synthesized in the same synthetic 
procedure as that in Synthesis Example 3. Purification was 
performed by HPLC with a Chemcobond 5-ODS-H column 
(conditions: linear gradient of 0-50% acetonitrile in 0.1% 
acetic acid, 40 min, 254 nm) . The resulting compound was 
used for a DNA-alkylat ing reaction. 
ESI-TOFMS m/e calcd. for C49H48N15O8 
[M + + H] 974.37, found 974.26 
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Synthesis Example 5 

Synthesis of AcImlmPyPy-y-PyPy-Indole-CBI (3-4) 

Compound (3-4) was synthesized in the same synthetic 
procedure as that in Synthesis Example 3. Purification was 
performed by HPLC with a Chemcobond 5-ODS-H column 
(conditions: linear gradient of 0-50% acetonitrile in 0.1% 
acetic acid, 40 min, 254 nm) . The resulting compound was 
used for a DNA-alkylat ing reaction. 
ESI MS m/e calcd. for C 6 2H 6 iNi 8 Oio 
[M + + H] 1217.5, found 1217.4 
Synthesis Example 6 

Synthesis of AcImPyPyPy-y-ImPy-Indole-CBI (3-5) 

Compound (3-5) was synthesized in the same synthetic 
procedure as that in Synthesis Example 3. Purification was 
performed by HPLC with a Chemcobond 5-ODS-H column 
(conditions: linear gradient of 0-50% acetonitrile in 0.1% 
acetic acid, 40 min, 254 nm) . The resulting compound was 
used for a DNA-alkylat ing reaction. 
ESI-TOFMS m/e calcd. for C 62 H 61 N 18 Oi 0 
[M + + H] 1217.5, found 1217.4 
Synthesis Example 7 

Synthesis of AcImlmPyPypPyPy-y-ImPyPlmPy-Indole-CBI (3-6) 

Compound (3-6) was synthesized in the same synthetic 
procedure as that in Synthesis Example 3. Purification was 
performed by HPLC with a Chemcobond 5-ODS-H column 
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(conditions: linear gradient of 0-50% acetonitrile in 0.1% 
acetic acid, 40 min, 254 nm) . The resulting compound was 
used for a DNA-alkylat ing reaction. 
ESI MS m/e calcd. for C90H93N30O16 

[M + + H] 1849.7, found 1850.1 
Synthesis Example 8 

Synthesis of AcImlmPy-p-ImPy-Indole-CBI (3-7) 

Compound (3-7) was synthesized in the same synthetic 
procedure as that in Synthesis Example 3. Purification was 
performed by HPLC with a Chemcobond 5-ODS-H column 
(conditions: linear gradient of 0-50% acetonitrile in 0.1% 
acetic acid, 40 min, 254 nm) . The resulting compound was 
used for a DNA-alkylat ing reaction. 
ESI-TOFMS m/e calcd. for C54H52N17O9 
[M + + H] 1082.5, found 1082.4 
Synthesis Example 9 

Synthesis of AcImlmlm-y-PyPy-Indole-seco-CBI (4-3) 

Compound (4-3) was synthesized by applying solution- 
phase synthesis of an already reported Py-Im polyamide. 
Purification was performed by HPLC with a Chemcobond 5-ODS-H 
column (conditions: linear gradient of 0-100% acetonitrile 
in 0.1% acetic acid, 20 min, 254 nm) . The resulting 
compound was used for a DNA-alkylat ing reaction. 
ESI-TOFMS m/e calcd. for C55H 5 5ClNi 7 0 9 
[M + + H] 1132.5, found 1132.4 
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Synthesis Example 10 

Synthesis of AcImlmPyPy-p-PyPyPy-p-Dp (6) 




Compound (6) was synthesized with an fmoc solid-phase 
synthesizer and separated from a solid-phase support by 
heating in the presence of dimethy laminopropy lamine . 
Purification was performed by HPLC with a Chemcobond 5-ODS-H 
column (conditions: linear gradient of 0-100% acetonitrile 
in 0.1% acetic acid, 20 min, 254 nm) . 
ESI-MS m/e calcd. for C53H67N20O10 
[M + + H] 1143.9, found 1143.5 

In each of Synthesis Examples 1 to 10 described above, 
reagents, such as N, N-diisopropylethy lamine ( 1 Pr 2 NEt) and 0- 
( 7-azabenzotr iazole-l-yl ) -1 , 1 , 3 , 3-tetramethyluronium (HATU) , 
solvents, such as N, N-dimethylf ormamide (DMF) , and the like 
were mainly purchased from a reagent manufacturer, such as 
Sigma-Aldr ich Co., and used without purification. Unless 
otherwise specified, the reaction was monitored by high- 
performance liquid chromatography (HPLC) at a UV wavelength 
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of 254nm UV. The 1 H-NMR spectrum was measured with JEOL 
JNM-A 500 (500 MHz) using tetramethylsilane (TMS) as an 
internal standard. With respect to multiplicity, a singlet, 
a doublet, a triplet, a quartet, a quintet, a multiplet, and 
broad were abbreviated as s, d, t, q, qu, m, and br, 
respectively. Electrospray ionization mass spectra (ESIMS) 
were measured with a PE SCIEX API 165. Electrospray 
ionization time-of -flight mass spectrometry (ESI-TOFMS) was 
performed on a BioTOFII (Bruker Daltonics) mass spectrometer. 
Polyacrylamide gel electrophoresis was performed on a 
HITACHI 5500-S DNA Sequencer. The loading dye 
(dimethylf ormamide with fushin red) was purchased from 
Amersham Co. Ltd., and 50% Long Ranger™ gel solution was 
purchased from FMC Bioproducts . Calf intestine alkaline 
phosphatase (AP, 1,000 unit/mL) was purchased from 
Boehr inger Mannheim . 

Synthesis of 5 1 -Texas Red-Labeled DNA Fragment 

A 5' -Texas Red-labeled DNA fragment (450 bp) (pUC-I', 
SEQ ID NO : 1 ) was prepared by a PCR method using a 5 1 -Texas 
Red-labeled 20-base-pair primer, i.e., 5 1 -TexRed- 
AGAATCAGGGGATAACGCAG-3 1 (pUC18 forward, 780-799; SEQ ID 
NO : 2 ) , and a 20-base-pair primer, i.e., 5'- 

TTACCAGTGGCTGCTGCCAG-3 1 (pUC18 reverse, 1459-1478; SEQ ID 
NO : 3 ) , pucl8 being used as a template. The DNA fragment was 
in a double-stranded state including a complementary strand 
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(the following fragments were in the same state) . 

A 5' -Texas Red-labeled DNA fragment (450 bp) (pUC-II, 
SEQ ID NO: 4) was prepared by a PCR method using a 5 1 -Texas 
Red-labeled 21-base-pair primer, i.e., 5 1 -TexRed- 
TGCTGGCCTTTTGCTCACATG-3 1 (pUC18 reverse, 1861-1881; SEQ ID 
NO:5), and a 19-base-pair primer, i.e., 5- 

TGTAAAACGACGGCCAGTG-3 1 (pUC18 forward, 328-345; SEQ ID NO : 6 ) , 
pucl8 being used as a template. 

A 5' -Texas Red-labeled DNA fragment (450 bp) (pUC-II 1 , 
SEQ ID NO: 7) was prepared by a PCR method using the 5 1 -Texas 
Red-labeled 19-base-pair primer (pUC18 forward, 328-345; SEQ 
ID NO : 6 ) and the 21-base-pair primer (pUC18 reverse, 1861- 
1881; SEQ ID NO : 5 ) , pucl8 being used as a template. 

A 5' -Texas Red-labeled DNA fragment (537 bp) (A,-F10906; 
SEQ ID NO : 8 ) was prepared by a PCR method using a 5 1 - Texas 
Red-labeled 23-base-pair primer, i.e., 5 ' -TexRed- 
ATCAGGGCAACTCAACCCTGTCC-3 1 (X-DNA forward, 10960-10982; SEQ 
ID NO : 9 ) , and a 20-base-pair primer, i.e., 5 1 - 

CAGGACGACCAATATCCAGC-3 1 (X-DNA reverse, 3 700 7-3 7026; SEQ ID 
NO: 10), X,— DNA being used as a template. 

A 5' -Texas Red-labeled DNA fragment (994 bp) (pQBl63; 
SEQ ID NO: 11) was prepared by a PCR method using a 20-base- 
pair primer, i.e., 5 1 -GGTGATGTCGGCGATATAGG-3 1 (SEQ ID NO: 12), 
and a 5 1 -Texas Red-labeled 20-base-pair primer, i.e., 5 1 - 
TexRed-CCCCAAGGGGTTATGCTAGT-3 1 (SEQ ID NO: 13), pQB!63 
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plasmid being used as a template. 

A 5' -Texas Red-labeled 727-bp DNA fragment (SEQ ID 
NO: 14) was prepared by a PCR method using a 5 1 -Texas Red- 
labeled 20-base-pair primer, i.e., 5 1 -TexRed- 

CCCATTCTAAACTGTACCCT-3 1 (SEQ ID NO: 15), and a 21-base-pair 
primer, i.e., 5 1 -GGCATCAAGGAAGGTGATTGG-3 1 (SEQ ID NO: 16), a 
plasmid containing MLP into which the code base sequence of 
human p-globin was integrated being used as a template. 

A 5' -Texas Red-labeled 446-bp DNA fragment (SEQ ID 
NO: 17 with a 2 4-t imes-repeated telomere sequence was 
prepared by a PCR method using a 5 1 -Texas Red-labeled 20- 
base pair primer, i.e., 5 1 -TexRed-GGCCAGTGAATTGTAATACG-3 1 
(SEQ ID NO:18), and a 20-base-pair primer, i.e., 5 ! - 
CCAGGCTTTACACTTTATGC-3 1 (SEQ ID NO : 1 9 ) , pCR2 . 1 plasmid into 
which an oligomer having a repetitive telomere sequence was 
integrated from the cleavage site by EcoRI being used as a 
template. The resulting each DNA fragment was purified by 
filtration using Suprec-02. The concentration of each 
resulting product was determined by measuring UV absorption. 
Ability to Alkylate Linear DNA (400 bp or more) 

The reaction of compounds (3-1) and (4-1) prepared in 
the above-described Synthesis Examples with DNA was studied 
using the long-chain DNA (pUC-II 1 ; SEQ ID NO : 7 ) . An 
alkylation reaction was performed at 23°C for 8 hours. The 
sequence alkylated was analyzed by polyacrylic gel 
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electrophoresis for sequence determination. Fig. 1 shows 
the results. Open-circular compound (4-1) and circular 
compound (3-1) had the same sequence specificity and was 
selectively alkylated adenine located at site 1 (5 ! -CGGCCA- 
3 ! ) at nanomolar concentrations. This sequence specificity 
can be explained by a model in which each compound dimerizes 
in the DNA minor groove to recognize the base sequence. 

Analysis by polyacry lamide gel electrophoresis was 
performed by the following process: A standard reaction 
solution containing the 5 1 -Texas Red-end-labeled DNA 
fragment (10 nM) , DMF (10% (v/v) ) , and an agent having a 
concentration described above in a sodium phosphate buffer 
(5 mM, total volume: 10 jiL, pH = 7.0) was charged into a 
microfuge tube (Eppendorf) and left standing at 23°C for 8 
hours. After completion of the reaction, the reaction was 
quenched by the addition of calf thymus DNA (1 mM, 1 \xL) . 
The resulting mixture was shaken at 90°C for 5 minutes. The 
DNA obtained by centr if ugat ion under reduced pressure was 
dissolved by the addition of 8 jaL of loading dye (fushin red 
solution in DMF), shaken at 94°C for 20 minutes, and then 
immediately cooled to 0°C. A 2-juL aliquot was separated 
electrophoret ically on a 6% denaturing polyacrylamide gel 
using a Hitachi 5500-S DNA sequencer. The following tests 
were conducted in a similar way. 

The reaction of compounds (3-2) and (3-3) prepared in 
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the above-described Synthesis Examples with DNA was studied 
using the long-chain DNA (pUC-II; SEQ ID NO: 4, pUC-I 1 ; SEQ 
ID NO : 1 ) . An alkylation reaction was performed for 8 hours. 
The sequence alkylated was analyzed by polyacrylic gel 
electrophoresis for sequence determination. Fig. 2 shows 
the results. As a result, in both compounds (3-2) and (3-3), 
efficient DNA alkylation comparable with that of a known Py- 
Im polyamide having a vinyl linker was observed at nanomolar 
concentrations. However, the difference in sequence 
recognition ability was observed. In compound (3-3), 
alkylation of site 4 ( 5 1 -AGCCA-3 1 ) , which was a match 
sequence, was observed. On the other hand, single-base-pair 
mismatch sequences in site 1 ( 5 1 -AGCTA-3 1 ) , site 2 (5 ! - 
AGTCA-3 1 ) , site 5 ( 5 ' -TACCA-3 1 ) , and the like were also 
observed. In contrast, in compound (3-2), alkylation in 
only site 3 ( 5 1 -TGACCA-3 1 ) and site 6 ( 5 1 -AGTCCA-3 1 ) , which 
were match sequences, was observed. From the stand point of 
the sequence recognition ability, the molecular design of 
compound (3-2) was excellent. Interestingly, although pUC- 
I' (SEQ ID N0:1) contained a site ( 5 ' -TGCCG-3 ' ) 
corresponding to a match sequence of compound (3-3), no 
alkylation of guanine was observed. 

In consideration of the results shown in Fig. 2, it is 
believed that the combination of the indole linker and CBI 
alkylates adenine at the N-3 position with highly sequence 
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specificity. In sequence recognition, it is believed that 
the indole linker adjusts the distance between the 
cyclopropane ring of CBI and the N-3 position of adenine so 
as to cause alkylation while functioning as like Py (Fig. 3). 

The reaction of compounds (3-2), (3-4), and (3-5) with 
DNA was studied using the long-chain DNA (?l-F10906; SEQ ID 
NO : 8 ) . The DNA-alkylat ing ability of each compound was 
evaluated. Fig. 4 shows the results. Sequence-specific DNA 
alkylation (site 2 and 3: 5 ' -TGACCA-3 ' , site 4: 5 ' -AGACCA-3 ' , 
site 5: 5 1 — TGTCCA— 3 T ) by compound (3-2) was observed at 
nanomolar concentrations as in the results shown in Fig. 2. 
Furthermore, Sequence-specific DNA alkylation (site 6: 5 f - 
AATCCA-3 ' , site 7: 5 1 — TAACCA— 3 T ) by compound (3-4) was also 
observed at nanomolar concentrations. In compound (3-5), 
alkylation of an only mismatch sequence (site 1: 5 1 -TGCTCA- 
3 ! ) was observed. These results indicate the possibility 
that in the molecular design of the hairpin Py-Im polyamide 
including the indole linker, alkylating agents that are 
capable of recognizing intended sequences can be produced by 
only changing the configuration of Py-Im. 

Establishment of the synthetic pathway using the indole 
linker made it possible to synthesize an alkylating agent, 
which cannot be synthesized using a known vinyl linker, 
having the ability to recognize a longer base sequence. For 
example, a Py-Im carboxylic acid having the ability to 
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recognize a long base sequence was prepared by Fmoc solid- 
phase synthesis for a Py-Im polyamide using an oxime resin, 
thus making it possible to synthesize compound (3-6) shown 
in Synthesis Example 7. Alkylation of the DNA fragment (SEQ 
ID NO: 11) derived from pQBl63 was studied. The results 
demonstrated that compound (3-6) efficiently alkylated 
1 , 0 0 O-base-pair DNA at nanomolar concentrations (Fig. 5). 
Furthermore, alkylation (site 2) due to 1-bp-mismatch 
recognition was observed. However, alkylation of AT-rich 
base sequences (sites 1 and 3) was also observed. In the 
Py-Im polyamide, such as compound (3-6), having the ability 
to recognize a long base sequence, the maintenance of high 
specificity is an important issue. It is believed that 
further optimization of the current molecular design makes 
it possible to produce a practical alkylating agent having 
the ability to recognize a sequence having 10 base pair or 
more with excellent versatility. 
Stability of Hairpin Polyamide 

Analysis of stability under acidic and alkaline 
conditions by HPLC was performed by the following process: A 
reaction solution containing compound (3-1) or (4-1) (100 
jiM) in a total volume of 10 |uL of an aqueous solution of 5% 
HC1 (pH = 1, DMF:H 2 0 =1:9) or an aqueous solution of 5% 
NaHCO.3 (pH = 9, DMF:H 2 0 = 1:9) was charged into a microfuge 
tube (Eppendorf) and left standing at 37°C. After 30 
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minutes, 2 hours, and 24 hours, analysis of compound (4-1) 
(16.8 min) and compound (3-1) (15.1 min) was performed by 
HPLC (linear gradient of acetonitrile 0-100% in 50 mM 
ammonium formate, 20 min, flow rate: 1.0 mL/min, 254 nm) . 

The Py-Im polyamide containing the indole linker was 
evaluated for stability by product analysis on HPLC using 
compounds (4-1) and (3-1). Fig. 6 shows the results. It 
was found that compound (4-1) was not hydrolyzed at all in 
the acidic aqueous HC1 solution (pH = 1) (left standing at 
37°C for 24 hours). Compound (4-1) was converted into 
compound (3-1) (30 minutes: 50%, 2 hours: 78%) in the 
alkaline aqueous NaHC0 3 solution (pH = 9) (left standing at 
37°C for 24 hours). No other hydrolysates were formed. 
Compound (3-1) was readily converted into compound (4-1) (30 
minutes: 96%) in the acidic aqueous HC1 solution (pH = 1) 
(37°C). That is, it was found that the Py-Im polyamide 
containing the indole linker was stably present in a seco- 
CBI form like compound (4-1) under the acidic condition (pH 
= 1) and in a CBI form like compound (3-1) under the 
alkaline condition (pH = 9). 

Sequence-Specific Alkylation of DNA by Polyamide Heterodimer 

A Py-Im carboxylic acid having the ability to recognize 
a sequence and a minor-groove binder can be provided by Fmoc 
solid-phase synthesis for a Py-Im polyamide. Thus, 
compounds (3-7) and (6) can also be prepared by Fmoc solid- 
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phase synthesis as shown in Synthesis Examples 8 and 10. 
Alkylation of the 727-bp DNA fragment (SEQ ID NO: 14) by 
compound (3-7) was studied. As a result, efficient 
alkylation of sites 1 and 2 based on a hairpin mode of 
recognition and of site 3 based on a linear-chain mode of 
recognition by compound (3-7) was observed at nanomolar 
concentrations (Fig. 7). Interestingly, a heterodimer was 
formed in the presence of compound (6) (500 nM) , which was a 
minor groove binder, and alkylation (site 4) based on 
accurate 10-base-pair recognition was observed. It was 
found that the synergetic effect of two molecules of 
compounds (3-7) and (6) resulted in the ability to recognize 
a long base sequence. This finding is important in view of 
the expansion of the recognition by the Py-Im polyamide. 
DNA Alkylation Targeting Telomeric Sequence 

A continuous base sequence, i.e., 5 1 -GGGTTA-3 1 , is 
present at each terminus of DNA in a cell. It is suggested 
that the base sequence is greatly associated with growth and 
replication of cell. Alkylation of the DNA fragment (446 
bp; SEQ ID NO : 1 7 ) containing 24 repeats of a telomeric 
sequence by compound (4-3) was studied. Efficient 
telomer ic-sequence-specif ic alkylation ( 5 1 -ACCCTA-3 1 ) in a 
hairpin mode of recognition by compound (4-3) was observed 
at nanomolar concentrations (Fig. 8). An alkylating agent 
containing the Py-Im polyamide having an indole linker has a 
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high degree of flexibility in molecular design, thereby 
possibly recognizing various target base sequence. 
Anticancer Activity in vitro by Hairpin Polyamide 

To examine the activity of the Py-Im polyamide (3-2) 
containing the indole linker, which showed the most 
satisfactory sequence-specific alkylation, evaluation was 
made on 39 cultured human cancer cell lines, such as breast 
cancer cell line and prostate cancer cell line (Fig. 9). 
The axis of ordinates of the bar graph shows the 39 cultured 
human cancer cell lines. The value "0" of the abscissa axis 
refers to the mean log IC 5 o value. A longer bar extending 
rightward means that the compound has a higher effect on the 
corresponding cancer. In contrast, a longer bar extending 
leftward means that the compound has a lower effect on the 
corresponding cancer. Compound (3-2) has relatively strong 
activity against cultured human cancer cells (mean log IC 5 o 
value of 39 cell lines is about -7 (100 nM) ) and thus shows 
interesting anticancer activity. Evaluation of the 
anticancer activity on the 39 cultured human cancer cell 
lines from breast cancer cell line to prostate cancer cell 
line was made by screening test performed in The Japanese 
Foundation for Cancer Research. 

HCT-116 (human colon cancer-derived cell), HeLa (human 
cervical cancer-derived cell), HLC-2 (human lung cancer- 
derived cell), and SH-SY-5Y (human neuroblast-der ived cell) 
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were treated with AcImlmPyPy-y-ImPy-Indole-CBI (3-2) (10~ 5 to 
10" 8 M with 0.1% DMF) for 48 hours to evaluate the effect of 
the compound as an anticancer agent. As a result, 50% cell 
growth inhibitory concentration (IC 50 ) against HCT-116, HeLa, 
HLC-2, and SH-SY-5Y were 7.42 x 10~ 8 , 5.97 x 10~ 8 , 5.35 x 10" £ 
and 7.43 x 10~ 9 M, respectively. Similarly, evaluation of 
293T (human kidney-derived normal cell) and WI-38 (human 
normal fibroblast) were made, and IC 50 was 6.99 x 10~ 8 and 
6.79 x 10" 8 M, respectively. The results indicated that the 
anticancer activity of the compound (3-2) against the 
cancerous cell lines was about 10 times that against the 
normal cell lines (IC 50 values were calculated with reference 
to a system as a control treated with 0.1% DMF for 48 hours) 

Evaluation of anticancer activity was made by the 
following process: HCT-116, HeLa, HLC-2, and SH-SY-5Y cell 
lines were seeded into a 96-well flat bottom plate in 
complete media [RPMI1640 (HCT-116, SH-SY-5Y; Sigma-Aldr ich 
Co.) each containing 10% fetal bovine serum and Dulbecco's 
modified minimal media (HeLa, HLC-2, 293T, WI-38; Sigma- 
Aldrich Co.)] at cell densities of 4.0 x 10 3 , 3.6 x 10 3 , 1.6 
x 10 3 , 7 . 0 x 10 2 , 5 . 0 x 10 2 , and 8.0 x 10 2 cells/well and were 
precultured for 24 hours in a CO2 incubator. The media were 
replaced with complete media containing 0.1% DMF and 10~ 5 to 
10" 8 M AcImlmPyPy-y-ImPy-Indole-CBI (3-2) . Incubation was 
performed for 48 hours in the CO2 incubator. Then, 10 



|u,L./well of a Cell Counting Kit-8 (Dojindo Laboratories) was 
added. The specimens were left standing for 2 hours in the 
CO2 incubator. Absorbance was measured with a microplate 
reader MPR-A4I (Tosoh Corp.). 

Anticancer Activity in vivo of Hairpin Polyamide 

Antitumor activity of compound (4-2) against 
transplantable human ER(+) breast cancer Br-10 in nude mice 
was studied. Human ER(+) breast cancer Br-10 was 
subcutaneously transplanted in dorsal portions of 8-week-old 
BALB/cA-nu/nu mice through a trocar. Injection of 0.1 mL of 
EP Hormone Depot was intramuscularly administered in Br-10- 
transplant mice simultaneously with the transplantation. 
When the tumor volume reached about 100 mm 3 , the mice were 
divided into 5 mice/group, and then the administration 
started . 

Compound (4-2) was dissolved in ethanol, diluted with 
physiological saline, and subcutaneously injected in divided 
doses into 3 or 4 portions near the tumor once a day and 3 
times every 3 days. The dose of compound (4-2) was 50 mg/kg 
(about 1 mg/mouse) in a volume of 0.05 mL/10 g. 
Physiological saline was subcutaneously injected in the 
control group. The length of the major axis (L) and the 
length of the minor axis (W) were measured every 3 days. 
The tumor volume was calculated with the following formula: 
Tumor volume (mm 3 ) = L (mm) x W 2 (mm 2 ) /2 
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Comparison of the tumor volume between the compound (4-2) 
treated group and the control group was performed. Table 1 
shows the results. Values shown in the table show relative 
estimation of each group when the tumor volume after 1 day 
in each group is defined as 1.0; and relative estimation 
calculated with (tumor volume of compound (4-2) treated 
group: T) / (tumor volume of control group: C) x 100%. 





after 
1 day 


after 
4 days 


after 
7 days 


after 
10 days 


after 
13 days 


after 
16 days 


control 
group 


1.0 


1.4 


1.7 


2.1 


2.7 


3.1 


compound 
(4-2) 


1.0 


1.1 


1.4 


1.4 


1.5 


2.0 


T/C (%) 


100 


78.6 


82.4 


66.7 


55.6 


64.5 



From Table 1, in the group in which subcutaneous 
injection of compound (4-2) was performed, the suppression 
of tumor growth was observed after the first administration. 
The subsequent administration showed further significant 
suppression of the tumor growth. In this experiment, 
toxicity that causes the weight loss of mice and the like 
was not observed. 
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